EQUATIONS SOLVABLE BY RADICALS 
IN A UNIQUELY DIVISIBLE GROUP 
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Abstract. We study equations in groups G with unique m-th roots for each positive integer 
m. A word equation in two letters is an expression of the form w[X, A) = B, where is a finite 
word in the alphabet {X, A}. We think of A, _B £ G as fixed coefficients, and X £ G as the 
unknown. Certain word equations, such as XAXAX — B, have solutions in terms of radicals: 

X = A-'/^{A'^^BA'/^f^''A~'^^ 

while others such as X^AX = B do not. We obtain the first known infinite families of word 
equations not solvable by radicals, and conjecture a complete classification. 

To a word w we associate a polynomial P„ £ Z[x,y] in two commuting variables, which 
factors whenever ui is a composition of smaller words. We prove that if Pw{x'^,y^) has an 
absolutely irreducible factor in Z[a;,j/], then the equation w{X,A) = B is not solvable in terms 
of radicals. 



1. Introduction 

A group G is called uniquely divisible if for every B G G and each positive integer m, there 
exists a unique X £ G such that 

X™ = B. 

We denote the unique such X by ^1/™, and its inverse by B ^Z*". In the literature, such groups 
are also referred to as Q-groups. Note that if it is not the trivial group, then G must be torsion- 
free, hence infinite. Examples of uniquely divisible groups include the group of positive units 
of a real closed field, unipotent matrix groups, noncommutative power series with unit constant 
term, and the group of characters of a connected Hopf algebra over a field of characteristic 
zero [1]. 

Inspired by trace conjectures in matrix analysis (see section [2D, we study here the natural 
question of which equations in a uniquely divisible group have solutions in terms of radicals. As 
a motivating example, consider the Riccati equation 

XAX = B 

with A,B £ G given and X £ G unknown. This equation has a unique solution X in any 
uniquely divisible group; moreover, its solution may be written explicitly (albeit in two distinct 
ways) as 

X = ^-1/2(^1/2^^1/2)1/2^-1/2 ^ ^1/2(^-1/2^-1^-1/2)1/2^1/2^ (j) 

More generally, let w{X, A) be a finite word in the two- letter alphabet {X, A}. An expression 
of the form 

w{X, A) = B (2) 
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is called a word equation. We are interested in classifying those word equations that have a 
solution in every uniquely divisible group. Clearly, the more general situation in which positive 
rational exponents on X, A, and B are allowed reduces to this one. 

The main tool in our analysis is a new combinatorial object Pu, G Z[2;,y], called the word 
polynomial. If w = A"-'-^ X A°-^ X ■ ■ ■ A'^"-^ X A""" , we define 

P^{x, y) := + xy^o+'^i + x^y'^o+^i+^a + • • • + . 

For a prime p, let (Z/pZ)* denote the set of nonzero elements of the finite field Fp = TLjpTL. The 
following is our main result. 

Theorem 1.1. There exists a uniquely divisible group G with the following property: For all 
finite words w in the alphabet {X,A}, if the equation 

Pv,{x\y^) = Q 

has a solution {xp,yp) G (Z/pZ)* x (Z/pZ)* for all but finitely many primes p, then there exist 
elements A, B, B' G G for which the word equation 

w{X,A) = B 

has no solution X £ G, and the word equation 

w{X,A) = B' 

has at least two solutions X £ G. 

We prove this theorem in section [H The group G is constructed from an infinite collection of 
pg-groups whose orders are chosen using Dirichlet's theorem on primes in arithmetic progressions. 

Despite appearances, Theorem 11.11 yields a computationally efficient sufficient condition for 
the equation w{X,A) = B to have no solution in G (see Corollary II. 7p . To put Theorem II. II in 
context, we next discuss a family of word equations which do have solutions in every uniquely 
divisible group, along with a conjectured complete classification of such words. 

In this paper, "word" will always mean a finite word over the alphabet {X, A} (unless another 
alphabet is specified). The word w is called universal if ([2]) has a solution X £ G for every 
uniquely divisible group G and each two elements A,B £ G; if this solution is always unique, 
then we say that w is uniquely universal. A related class of words is those for which ([2]) has a 
solution "in terms of radicals." This notion is defined carefully in section [3l Our explorations 
give evidence for the surprising conjecture that all three of these classes are in fact the same, 
and can be characterized as follows. 

Definition 1.2. A word w in the alphabet {X, A} is totally decomposable if it is the image of 
the letter X under a composition of maps of the form 

• 7i"m,fc '■ w I-)- {wA'')"'^w, for m > 1, A; > 

• r : w t-^ wA 

• I : w ^ Aw. 

For example, the word w = X AX^ AX AX AX^ AX is totally decomposable, as witnessed 
by the composition w = vri^i o vti^q ° According to the following lemma, any totally 

decomposable word is uniquely universal. 

Lemma 1.3. Let G be a uniquely divisible group, and let w be a totally decomposable word. For 
any A,B £ G, the equation w{X, A) = B has a unique solution X £ G, and this solution can be 
expressed in terms of radicals. 
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For the proof, see section [3l One easily deduces, for example, that the word equation 
X AX^ AX AX AX^ AX = B has a unique solution in every uniquely divisible group: 

( r 1 1/2 ^ 

X = i [a-1/2(^1/25^1/2)1/2^-1/2J ^ ^1/2 I ^-1/2^ 

We conjecture the converse: totally decomposable words are the only universal words. 

Conjecture 1.4. Let w be a finite word in the alphabet {X,A}. The following are equivalent. 

(1) w is totally decomposable. 

(2) w is uniquely universal. 

(3) w is universal. 

(4) w{X,A) = B has a solution in terms of radicals, (see Definition\3.4^ 



The implications (1) =^ (2) =^ (3) and (3) (4) are straightforward (see section [3]). The 
remaining implication (4) =^ (1) is the difficult one. Theorem 11.11 arose out of our attempts 
to prove this implication. It reduces the noncommutative question about word equations to a 
commutative question about solutions to polynomial equations mod p. More concretely, we use 
Theorem 11.11 to prove that several infinite families of word equations are not solvable in terms 
of radicals (CoroUarv ll.Sp . To our knowledge, these are the first such infinite families known. 

Together with Theorem II. 1|, the following would imply Conjecture 11.41 

Conjecture 1.5. Ifw is a word that is not totally decomposable, then the equation ^^(x^,^^) = 

has a solution {xp,yp) € (Z/pZ)* x (Z/pZ)* for all but finitely many primes p. 

The questions outlined above (e.g., asking whether a particular word w is universal, or 
uniquely universal, or solvable in terms of radicals) are examples of decidability questions in 
first-order theories of groups. Determining whether a set of equations has a solution in a group 
is known as the Diophantine problem. More generally, given a set of axioms for a class of groups, 
one would like to provide an algorithm which decides the truth or falsehood of any given sen- 
tence in the theory. That such an algorithm exists for free groups follows from pioneering work 
of Kharlampovich and Myasnikov [19] (see also the independent work of Sela [31] ) , but it is still 
open whether one exists for free uniquely divisible groups F'^. The Diophantine problem for F"^ 
does admit such an algorithm [18j although the time complexity of the algorithm described there 
is likely at least doubly exponential (the proof uses the decidability of Presburger arithmetic). In 
contrast. Conjecture II .41 savs that the Diophantine problem of a single equation in one variable 
reduces to an easily verifiable combinatorial condition (total decomposability) . 

Example 1. Consider the word w = X^AX, which is not totally decomposable; we use Theorem 
ll.ll to show that w is not universal. Its word polynomial is 

Pw{x,y) = l + x + x^y. 

We need to verify that the equation 1 + + (x^y)^ = has a nonzero solution modulo p, for 
all sufficiently large primes p. A standard pigeonhole argument shows that for all primes p, 
there exist a, 6 G Z with a 7^ such that 1 + + 6^ = (mod p). If 6 = and p > 5, then 

1 -|- (—1 -|- 4^-*^)^ + (a + 04"-*^)^ = (mod p) so that we may assume both o, b nonzero. Setting 
X = a and y = ba^"^ gives us our solution. 

The word polynomial for the totally decomposable word v = XAXAX, on the other hand, is 

P^,(x, y) = l + xy + x'^y^. 

Let p be a prime greater than 3. li x,y G (Z/pZ)* satisfy Py{x'^,y'^) = 0, then setting z = x^y^ 
we have = \ and ^ 7^ 1, which forces p = 1 (mod 3). □ 
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Recall that a polynomial over a field K is absolutely irreducible if it remains irreducible 
over every algebraic extension of K. The next result shows that to verify Conjecture 11.51 for a 
particular word w, it suffices to prove that a factor of Pw{x'^,y'^) is absolutely irreducible. 

Proposition 1.6. Suppose F G Z[a;,y] satisfies -F(0, 0) 7^ 0, and F has a factor f £ 'L\x^y\ 
which is irreducible over C[x,y]. Then the equation F{x,y) = has a solution {xp,yp) G 
(Z/pZ)* X (Z/pZ)* for all but finitely many primes p. 

Proof. By a theorem of Ostrowski [30j, the factor f{x,y) is absolutely irreducible modulo all 
sufficiently large primes p (see also [281 Corollary 2.2.10] for a modern perspective from model 
theory). For such a prime p, the homogenization h{x,y,z) £ Z[x,y,z] of / is also absolutely 
irreducible modulo p, and thus defines an irreducible projective curve over ¥p. By results of 
Bach [3] (see also Aubry-Perret [3]), extending a classical theorem of Weil, the curve defined by 
h has at least p — d'^^^p projective points all of whose coordinates lie in ¥p, where d is the degree 
of /. Since F(0, 0) 7^ 0, the polynomial / is not divisible by x or y, so at most 3d of these points 
lie on the x-axis, y-axis, or the line at infinity. It follows that for all sufficiently large primes p, 
there exists a point {xp,yp) G (Z/pZ)* x (Z/pZ)* with f{xp,yp) = 0. □ 

Corollary 1.7. If w is a word in the alphabet {X,A} beginning with X, and if Pii;{x'^,y'^) has 
a factor f G Z[x, y] such that f is irreducible in C[x, y], then w is not universal. 

Example 2. We show the usefulness of Corollary 11.71 by revisiting Example [H The word 
w = X^AX has 

which is irreducible over C (since 1 + is not a square in C(x)). It follows that X^AX is not 
universal. In contrast, the totally decomposable word v = XAXAX has 

P^,(x^y2) = l + x^y^ + xV = (l + xy + x2y2)(l -xy + x^y^). 

Each factor on the right side is irreducible over Z but factors over C. □ 

In Section [71 we use Corollary 11.71 to verify Conjecture 11.51 for the following infinite families 
of words. 

Corollary 1.8. The following families of words do not have their equations solvable in terms 
of radicals: 

m,n> 1, m/n; 
X.4™+2"X^™+"XA™X, m > 0, n > 1; 

XAX'^AX, n > 3; 

X2(AX)"X, n > 2. 

Using Corollary 11.71 and the symbolic computation software Maple, we have also verified 
Conjecture 11.41 for all words of length at most 10. The most difficult part of the computation is 
to check whether a given bivariate polynomial over Z is irreducible over C. This can be done in 
polynomial-time using the algorithm of Gao [TT] (and is implemented in Maple). 

We do not know if the condition in Corollarv 11.71 is sufficient to prove Conjecture 11.41 

Question 1.9. // the word w is not totally decomposable and begins with X, must Pw{x'^,y'^) 
have a factor in Z[x,y] which is irreducible over C[x,y] ? 

The remainder of the paper is organized as follows. In section [21 we give additional motivation 
arising from the BMV trace conjecture in quantum statistical mechanics. In section [3l we review 
the basic properties of uniquely divisible groups and construct a free uniquely divisible group on 
two free generators. This construction allows us to define the notion of solvability in terms of 
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radicals, but it is not needed for the proof of Theorem ll.il Section |4] describes some important 
examples of uniquely divisible groups. Most of the standard examples have the property that 
every word equation with nonnegative exponents has a unique solution; the need to construct 
more exotic groups is part of what makes Conjecture 11.41 so difficult. Section [5] discusses prop- 
erties of the word polynomial Pw , and section [6] is devoted to the proof of Theorem 11.11 These 
two sections form the heart of the paper. Finally, Section [7] contains the proof of Corollarv 11.81 



In this section we explain briefly the connections to matrix theory which inspired our work. 
None of this material will be needed for the sequel, so the reader may safely skip to Section [3] if 
desired. The Lieb-Seiringer formulation |26) of the long-standing Bessis-Moussa-Villani (BMV) 
trace conjecture [ZllMllMlliniEllIIlllllllIlllOllS] in statistical physics says that the trace of 
5'm,fc(^, B), the sum of all words of length m in ^ and B with k Bs, is nonnegative for all n x n 
positive semidefinite matrices A and B. In the case of 2 x 2 matrices, more is true: every word 
in two positive semidefinite letters has nonnegative trace (in fact, nonnegative eigenvalues). It 
was unknown whether such a fact held in general until [17] appeared where it was found (with 
the help of Shaun Fallat) that the word w = BABAAB has negative trace with the positive 
definite matrices 



Finding such examples is surprisingly difficult, and randomly generating millions of matrices 
(from the Wishart distribution) fails to produce them. Nonetheless, it is believed that most words 
can have negative trace, and it was conjectured [17J that if a word has positive trace for every pair 
of real positive definite A and B, then it is a palindrome or a product of two palindromes (the 
converse is well-known). If we replace the words "positive trace" in the previous sentence with 
"positive eigenvalues," we obtain a weaker conjecture which was also studied in Further 
evidence for this conjecture can be found in [15], where it was proved that a generic word has 
positive definite complex Hermitian matrices A and B giving it a nonpositive eigenvalue. 

Positive definite matrices which give a word a negative trace are also potential counterex- 
amples to the BMV conjecture, and it is useful to be able to generate these matrices (see [13^ 
§4.1] and [21 §11] for two such examples). As remarked above, this is difficult since random 
sampling does not seem to work. This discussion explains some of the subtlety of the BMV con- 
jecture: most words occurring in Sm.k{A, B) likely can be made to have negative trace; however, 
a particular word has a small proportion of matrices which witness this. 

Although the set of n x n positive definite matrices is not a group for n > 1, every positive 
definite matrix has a unique positive definite m-th root for any ra. More remarkably, it turns 
out [ini [2j that every word equation w{X,A) = B with w palindromic (and containing at least 
one X) has a positive definite solution X for each pair of positive definite A and B (although 
this solution can be non- unique [2]). Using A\ and it follows that any word of the form 
wAwAAw with w = w{B, A) palindromic (and containing at least one B) can have negative 
trace. This gives an infinite family verifying the conjecture of [T7], and moreover, provides an 
infinite number of potential counterexamples to the BMV conjecture. The existence proof in 
|16| uses fixed point methods, although for special cases (e.g. when w contains four or less ^s), 
one may express solutions X explicitly (and computationally efficiently) in terms of A, B and 
fractional powers [21 §5]. Computing solutions without using these formal representations "in 
terms of radicals" is difficult [21 Remark 11.3], and it is believed that most equations do not 
have solutions expressable in this manner. For instance, there is no known expression for the 
solution to XAX^AX = B although there is always a unique positive definite solution [23] . 




2. Background and Motivation 



Ai 



1 20 210 
20 402 4240 
210 4240 44903 



and Bi 



36501 -3820 190 
-3820 401 -20 
190 -20 1 
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3. Radical words and the free uniquely divisible group 

In this section we review some basic properties of uniquely divisible groups, and construct the 
free uniquely divisible group on two generators. This construction allows us to define precisely 
the notion of "solvability in terms of radicals" (but we emphasize that the proof of Theorem 11.11 
does not rely on this construction). The following lemma shows that rational powers of group 
elements are well-defined and behave as expected. 

Lemma 3.1. Let G be a uniquely divisible group and a & G. Define a"/™" := (a")^/™ for n G Z 
and 7^ m G N, and define a" := 1. Then if p,q £ Q, we have (a^)"? = {a'^)P = a^'^ and 
aPa'i = a'^aP = 0^+" . 

Proof The first claim follows from the observation that (a")^/'" = ((ai/'")mn^)i/m ^ (aV'n)ri. 
To prove the second claim, it suffices to check that a^/*" and a^/^ commute for positive integers 
m and £. Let b = aa^/"^a~^. Then 6*" = a, so a commutes with a^^"^. Now letting c = 
^i/m^i/l^-i/m^ we have = a-^/™aa~^/™ = a, so a^/^ commutes with a^/*". □ 

A detailed study of uniquely divisible groups can be found in the thesis of Baumslag [5] 
where they are called divisible R-groups^ See also [25j for a study of the metabelian case. As 
remarked in [5] , one of the difficulties is that there is no clear normal form for uniquely divisible 
group elements (for example, see ([1]) from the introduction). There is, however, the notion of 
a free uniquely divisible group which comes out of Birkhoff's theory of "varieties of algebras" 
[8]. Since the construction is simple, we briefly outline the main ideas here. Our perspective 
is model-theoretic (see f28] for background) although we will use only basic notions from that 
subject. 

Let T be the first-order theory of uniquely divisible groups. The underlying language and 
axioms of this theory are those of groups, with an additional (countably infinite) set of axioms 
expressing that every element has a unique m-th root for each positive integer m. Consider 
the smallest set S of finite, formal expressions containing letters {A,B}, exponents of the form 
n/m (t^ g 7^ m G N), and balanced parentheses that is closed under taking concatenations 
and powers (and contains the empty expression). For example, S contains the two rightmost 
expressions in ([T]). 

If G is a uniquely divisible group and a,b £ G, then an expression e = e{A, B) £ S defines 
unambiguously (by Lemma 13. ip an element e(a, 6) G G by replacing letters {A, B} with cor- 
responding group elements {a, 6} and then evaluating the result in G. When two expressions 
e, f G S evaluate to the same group element for each pair a,b £ G in every uniquely divisible 
group G, we write e ~ /. For instance, the two rightmost expressions in ([TJ are equivalent in this 
way. Although we will not need it here, Godel's completeness theorem (along with soundness) 
implies that e ~ / if and only if there is a (finite) formal proof from the axioms of T that they 
are equal. 

Note that ~ is an equivalence relation on S, and we write [e] for the equivalence class con- 
taining e £ S. 

Definition 3.2. The set := {[e] : e £ S} with multiplication [e] • [/] = [ef] is called the free 
uniquely divisible group on letters L = {A,B}. 

The definition extends in the obvious way to define the free uniquely divisible group on any 
set L, but (except for a remark at the very end of the paper) we shall only use the case of two 
generators. 

-'^In fact, Baumslag studied a generalization of uniquely divisible groups to those having unique p-th roots for 
all p £ w, where a; is a subset of the primes. We restrict ourselves to the case when u} is the set of all prime 
numbers. 
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The main facts about J- that we wih need are summarized in the following lemma. We remark 
that any homomorphism 'il^ : F ^ G between uniquely divisible groups is easily seen to satisfy 
'[\){cfl) = ij){aY for all a G F and g G Q. 

Lemma 3.3. T is a uniquely divisible group. Moreover, J- satisfies a universal property with 
respect to the map 9 : L ^ sending A i— )• [A] and B i— t- [B] : Given any uniquely divisible group 
G and any map (p : L ^ G, there exists a unique homomorphism (of uniquely divisible groups) 
^ : T ^ G such that %[) o 9 = cf). 

Proof. That the multiplication in Definition 13.21 is well-defined and that T forms a group follow 
directly from the definitions. Moreover, every element [e] G J- has an m-th root, namely [(e)^'''"]. 



If [/] is another m-th root of [e], then for every uniquely divisible group G and all a,b G G, we 
have f{a, 6)™ = e(a, b) and thus /(a, b) = (e(a, 6))i/"^. It follows that [/] = [(e)i/"], and thus J" 



This discussion allows us to formally define the concept of solution in terms of radicals men- 
tioned in the introduction (specifically, in the statement of Conjecture II. 4p . 

Definition 3.4. A word w is called radical (and has equation w{X, A) = B solvable in terms 
of radicals) if the equation w{X, [A]) = [B] has a solution X £ J^. 

We now connect this definition with the idea of word equations having solutions in radicals. 
Let G be a uniquely divisible group. A subgroup R C G is called radical if x"^ G R implies 
X £ R for all X G G and all positive integers m. Given any subset H of G, recursively define 
sets Rn for n > by setting Rq = H and 



We call the union TZ{H) := IJneN radical subgroup of G generated by H. One easily checks 

that TZiH) is a radical subgroup of G and that it is the intersection of all radical subgroups 
containing H. 

For a uniquely divisible group G and a,b £ G, the subgroup Tl{{a,b}) can be thought of as 
the radical expressions generated by a and b. Given a specific instance of the word equation 
w{x, a) = b, any solution x G TZda, b}) can be viewed as one "in terms of radicals." Of course, 
whether a particular word equation in a group has a solution in terms of radicals in this sense 
depends on the group (and the elements a,b G G). However, as the next lemma shows, a 
radical word always has such a solution. Note that this verifies the implication (4) =^ (3) in 
Conjecture 11.41 

Lemma 3.5. Let w be a radical word, and let G be a uniquely divisible group. For any a,b £ G, 
the equation w{x,a) = b has a solution x which lies in the radical subgroup TZ{{a,b}) C G. 

Proof. Let X £ F he a solution to w{X, [A]) = [B]. By Lemma [3.3l there exists a homomorphism 
ip : F ^ G with ipHA]) = a and ^([i?]) = b. Let x = ip{X) £ G and notice that 



Moreover, by the construction in Lemma [3.31 it is straightforward to verify that il^{F) = TZ{{a, b}). 




□ 



Rn+i = {{xy)" : x,y£Rn, q£ Q}. 



i^iwiX, [A])) 
w{^PiX),^Pi[A])) 
w{x, a). 



Thus, we also have x £ 7^({a, 6}). 



□ 
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We close this section with a proof of Lemma ll.31 which shows that (1) =^ (2) in Conjecture ll.41 
As the implications (2) =^ (3) =^ (4) are trivial, the sole unproved implication is (4) =^ (1). 

If u and V are words in the alphabet {X, ^}, we define the composition u o v as the word 
obtained by replacing each occurrence of the letter X in u by the word v. 

Proof of Lemma \1.3l We induct on the number of compositions involved in the word w, the base 
case w = X being trivial. If w is totally decomposable and w ^ X, then w = u o v, where u 
is totally decomposable and v = (j){X). Here (p is one of the elementary morphisms /, r, 'K'^n^k 
in Definition II. 2i By the inductive hypothesis, the equation u{y,a) = b has a unique solution 
y & G, and this solution can be expressed in terms of radicals, i.e., y £ TZ{{a,b}). Hence an 
element x G G solves the word equation w^x, a) = b if and only if (j){x) = y. It therefore suffices 
to show that for any y £ G there is a unique x £ G such that (j){x) = y. The cases (j) = £,r are 
immediate. For the case = 7Tm,k: l^t z = a'^/^xa^/^, so that 



In addition to the free uniquely divisible group encountered in the previous section, there 
are many interesting examples of uniquely divisible groups. We discuss several of them here, 
although this list is far from exhaustive. 

Recall that a real closed field is an ordered field K whose positive elements are squares and 
such that any polynomial of odd degree with coeffients in K has a zero in K. It follows from 
the definition that each positive element of a real closed field has a positive m-th root for every 
positive integer m. Moreover, since the field is ordered, this positive root is unique. The group 
of positive elements of a real closed field is therefore uniquely divisible. 

The rest of our examples are noncommutative. The free group F2 on the alphabet {A, B} 
may be embedded via the Magnus homomorphism (pM [27j into the algebra Q((a, b)) of noncom- 
mutative power series via ^ 1— t- 1 + a and B 1 + b. The image of this map is a subgroup of 
the group D of noncommutative power series with constant term 1. Using the binomial series, 
it can be shown that D is uniquely divisible (see also Proposition 14.11 below for another proof). 
In particular, F2 is a subgroup of a uniquely divisible group. 

Our next result shows that every word equation with coefficients in D has a unique solution 
in that set. However, this solution might not be in the radical subgroup 'R'{4'm{F2)) generated 
by (t)M{F2). 

Proposition 4.1. For any Ai, . . . , Am, B G D, the equation YYlLii^i-^) — ^ unique 
solution X £ D. 

Proof. Write 




□ 



4. Examples of Uniquely Divisible Groups 



1 + 



w£{a,b}* 



B 



1 + 



Pww{a,b), 



uig{a,fe}* 



X 



1 + 




«)G{a,b}* 
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where the sums are over all nonempty finite words in the alphabet {a, b}. Denote the length of 
a word w by Expanding the product 



n (^^^) 



B 



(3) 



i=l 



and equating coefficients, the coefficient of w^a, b) on the left hand side is equal to mxw plus a 
polynomial (5«;(x, a) in the variables x„ for \u\ < \w\ and the variables for \u\ < \w\. Hence 
we obtain for each nonempty word w that 

Assuming, inductively, that has been determined for all \u\ < \w\, this polynomial system 
has the unique solution 



m 



It follows that ([3]) has a unique solution X. 



□ 



A natural problem in this setting is to characterize those power series in D which arise as 
radical expressions of 1 + a and 1 + 6. 

Problem 4.2. Characterize those power series f £ D which are elements of lZ{(j)M{F2))- 

Let ip : J- ^ D he the homomorphism of uniquely divisible groups with = 1 + a and 

■0([-B]) = 1+6 given by Lemma [3. 31 Surprisingly, while the Magnus homomorphism (j)M : F2 ^ D 
is an embedding of groups, it is a very old open question whether ip is also an embedding. As 
far as we know, Baumslag has the best result on this problem [6j, giving injectivity when ip is 
restricted to certain one-relator subgroups of T. 

Our next example is a matrix group. Let K he a field of characteristic and let UT^ he 
the group of n x n unipotent matrices over K. These are the upper triangular matrices with 
coefficients in K with I's along the diagonal. 

Proposition 4.3. For any Ai, . . . , Am, B £ UTn, the equation ]Xh=i (AiX) = B has a unique 
solution X G {7T„. (In particular, UTn is a uniquely divisible group.) 

Proof. Induct on n. For n = 1, the claim is obvious as UTi = {1}. Next for n > 1, let 
Ai, B G UTn+i (i = 1, . . . , m) and write each Ai, B, X in block form 



Ui Ui 


, B = 


' V V ' 


, x = 


' Y 


y 


1 


1 





1 



in which Ui,V G UTn and Ui,v £ K""; and Y £ UTn and y £ K"^ are unknowns. When the 
equation HI^i {-^i^) = B is expanded, it takes the form 



n™ 1 iuY) u + Er=i [(nS {u^Y) ) u, 
1 



y 





' V v' 




1 



(4) 



in which u £ is a function of Ui,Ui, and Y only (and thus does not depend on y). 

By the inductive hypothesis, the equation HI^i i^i^) — ^ ^ unique solution Y £ UTn- 
Let M = YlT=i (nti (^i^)) Uj- Since K has characteristic zero, the diagonal entries of M are 



all equal to m 7^ 0, so M is invertible. The word equation YYlLi i^i^) = B therefore has the 



unique solution X 



Y y 
1 



where y = M ^{v — u). 



□ 
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Remark. The proof of Proposition 14.31 gives a recursive algorithm for finding the unique solu- 
tion X. Moreover, the proof carries over to a slightly larger class of groups. Let Ri,. . . ,Rn be 
radical subgroups of K* , such that for each i = 1, . . . ,n, the additive semigroup generated by 
Ri does not contain 0. Let G be the group of n x n upper-triangular matrices over K whose 
i-th diagonal entry lies in Ri for all i. This group is uniquely divisible, and every word equation 
= B with Ai,B G G has a unique solution X £ G. For example, taking = R with 
i?i = M+ and R2 = {1}, we obtain the group of affine transformations t at + b, where a S M_|_ 
and beR. 

5. The word polynomial 

Given a finite word w over the alphabet {X, A}, write 

w = A'^o XA"^ X • • • A""- 1 XA"" (5) 

for nonnegative integers oq, ■ ■ ■ , ffln- The word polynomial of w is the polynomial in commuting 
variables x and y given by 

n-1 
fc=0 

For example, the word w = X^^^AX has word polynomial 

Pw{x,y) = I + X + x'^ -\ h x""^ + x"-'^y. 

Note that if w ends in X (i.e., an = 0) then w can be uniquely recovered from Py^. 

If u is another word over the same alphabet, the composition uo w is the word obtained by 
replacing each occurrence of the letter X in u by the word w. Although composition of words is 
not commutative, it can be modeled by multiplication of polynomials according to the following 
lemma. 

Lemma 5.1. Let u and w be finite words in the alphabet {X,A} ending with X, and let m,n 
be respectively the number of letters in w equal to A,X. Then 

Puow{x,y) = Pu{x'^y'^,y)Pw{x,y). 

Proof. If w is given by ([5]), write 

u = A'^oXA^^X ■■■ A^^-^X; 
uow = A'^XA'^^X ■ ■ ■ A^^'^-^X. 
Then for < g < m — 1 and 0<r<n — Iwe have 

Qr + bq, r = 

Or-, r > 0. 



Cqn+r 



Since m = + ■ ■ ■ + a„_i, we have 

Co H 1- Cqn+T = bo-\ Vbq + qm + aQ-\ ha,., 

hence 

m— 1 n—1 

Puou,{x,y) = ^ ^a;9"+^'-y^O + -+^+gm+ao + -+a. 
q=Q r=0 

m—1 n—1 
_ rJ.'lnyqm+bo^ hfeq x^y°'°~^ 

q=0 r=0 

= Pu{x^y'^,y)PUx,y). □ 
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Our next lemma shows another context in which the word polynomial arises: from sub- 
stituting certain affine transformation matrices for the letters of w. 

Lemma 5.2. Let x, y, z be commuting indeterminates, let w{X, A) be a finite word, and let m, n 
be respectively the number of letters in w equal to A,X. Then 

Hx zl ly Ol\ ^ fx^y'" Pw{x,y) 
^ ^0 ij ' [o ij y [ 1 

Proof. Note that Pa = 0, Px = 1, and P obeys the recurrence 

Pxw{x,y) = xPy,{x,y) + 1, 
PAw{x,y) = yPw{x,y). 

Here Xw (resp. Aw) denotes the word obtained by prepending the letter X (resp. A) to w. The 
result follows by induction on m + n. □ 

Remark. One can also define a noncommutative word polynomial P^ by taking x, y, z above 
to be noncommuting indeterminates. It is given by 

Pw{x, y) = + + • • • + y'^'xy'^^x ■ ■ ■ xy""-^ . 

However, we will only use the commutative word polynomial P^ in this paper. 

6. Proof of Thoerem 11.11 
We begin with the following elementary fact. 

Lemma 6.1. Let G be a group of order n. If m and n are relatively prime, then every element 
of G has a unique m-th root. 

Proof. There exist r,s ^7L such that rm + sn = 1. One now checks that is the unique m-th 
root oi g. □ 

Let Gi (i = 1,2,.. .) be an infinite sequence of finite groups with the following property: For 
every positive integer m, there exists an iV such that 

m and j^Gi are relatively prime for all i > N. (6) 

By Lemma l6.ll these groups have a limiting kind of unique divisibility, which suggests taking 
the quotient of the direct product of the Gi by their direct sum. 

Lemma 6.2. If Gi,G2, ■ ■ ■ is a sequence of finite groups satisfying then the group 

oo oo 
i=l i=l 

is uniquely divisible. 

Proof. Given g = (gi, g2, ■ ■ ■) ^ Y\Gi, write [g] for its image under the quotient map Y\ Gi — )• G. 
For any g £Y\Gi and any m > 1, by Lemma [6. II and ([6]), there exists N such that for alH > 
the element gi has a unique m-th root hi G Gi. Set hi = 1 for all i < N. Then [/i]™ = [g]. 
Moreover, if /i' G G satisfies [/i']™ = [g], then (/i^)™ = gi for all but finitely many i. By ([6]) it 
follows that h[ = hi for all but finitely many i, so [h'] = [h]. Thus every element [g] € G has a 
unique m-th root in G. □ 
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This lemma allows us to construct many examples of uniquely divisible groups. Next we 
describe the sequence of groups Gi that we will use to prove Theorem II. 1[ 

Let p be an odd prime, and let q = Since the group (Z/pZ)* of units mod p is cyclic of 
order p — 1, we can pick an element t E (Z/pZ)* whose multiplicative order mod p'lsq (namely, 
t can be the square of any generator). The powers of t are exactly the nonzero squares, i.e. 
quadratic residues, in TLjpTL. We take to be the semidirect product (TLjqTl^ x {Ij/plj)^ which 
has the presentation 

Gp = {S,T : 5*r = TS, r« = 1, SP = 1). 

The group Gp can be realized concretely as the group of affine transformations of Z/pZ of 
the form 

z ^ az + b 

where a G (Z/pZ)* is a quadratic residue and b G TLjpTL is arbitrary. Thus we can view Gp as 
the group of all 2 x 2 matrices of the form 

b 

1 

where k ^ Tj/qlj and b G TLjpTL. The generators S and T correspond to the affine transformations 
z I— >■ z + 1 and z I— )■ tz, or the matrices 

S 

Lemma 6.3. Lei a, /3 G {0, . . . , (7 — 1} and 7 G {0, . . . ,p — 1}. For any word w = w{X, A), the 
following identity holds in the group Gpi 

where m and n are respectively the number of A's and X 's in w. 
Proof. Since 



"1 


1' 




't 


0" 






T = 





1 







1 



't^ 7 


rj-ia. 


"t" 0" 


1 




1 



the result follows from setting x = t^ , y = t°^ and z = 7 in Lemma 15.21 □ 

Lemma 6.4. Let w be a finite word in the alphabet {X,A}, and let n be the number of letters 
in w equal to X . Let p be a prime such that q = £^ is relatively prime to n. If the equation 

Pn^ix^y^) = 

has a solution {x, y) G {'Ljp'L)* x iJLjpTL)* , then there exist a,b £ Gp for which the word equation 

w{X, a) = b 

has no solution X G Gp. 

Proof. Suppose (x,y) G {ZjpZ)* x {JLjpTL)* solves Pyj{x'^,y'^) = 0. Since any quadratic residue 
mod p is a power of t, we can find integers a, 5 such that = t^ and y"^ = t". Let 

grpam+Sn 



a = T'', 

where m is the number of letters in w equal to A. By Lemma 
solves the word equation w{X, a) = 6 if and only if 



5n 



an element X = S^T^ G G„ 



(7) 
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Equating powers of T, we obtain f3n = 6n (mod q). Since n and q are relatively prime, it follows 
that (3 = 6 (mod q), and hence = (mod p). Now equating powers of S" in ([7]) yields 

1 = -fPut^n = iput^n = iPUx^y^) = (mod p), 

so there is no solution X to w{X, a) = 6 in Gp. □ 

We may now prove our main result. 

Proof of Theorem \l.l\ Let ttq = 2, vri = 3, . . . be the primes in increasing order. By the Chinese 
remainder theorem, for each i > 1 there is an integer ki satisfying 

ki = Z (mod 4) 

ki = 2 (mod TTj), j = 1, . . . ,i. 

By Dirichlet's theorem on primes in arithmetic progression, for each i there exists a prime pi 
satisfying 

Pi = ki (mod 47ri . . .TTj). 
By construction, ^^^^ is not divisible by any of 2, vri, . . . , vTj. Since 

'tt'^Pi — 2 ' 

the sequence of groups Gp^ , Gp^ , . . . satisfies condition ([61) , so by Lemma 16.21 the quotient group 
is uniquely divisible. 

Now let w be a word in the alphabet {X, and let n be the number of letters in w equal 
to X. Let TTjg be the largest prime divisor of n. For i > we have ^^^^ relatively prime to n. 
By hypothesis, we can choose ii > iq sufficiently large so that the equation P{x^,y^) = has a 
solution {xi,yi) G (Z/piZ)* x (Z/piZ)* for all i > ii. By Lemma [6^ for each i > ii, there exist 
ai,bi G Gp- for which the word equation 

w{X,ai) = hi 

has no solution X G Gp.. Let A.,B ^ G he the images of the sequences (aj)j>j^ and {bi)iyi-^ 
under the quotient map 

n Gp, G. 

i>ii 

The equation w{X,A) = B has no solution X G G. 

Finally, we prove that there exists B' £ G such that w{X, A) = B' has at least two solutions 
X £ G. Using the result just proved, for alH > ii, the map Gp- — )• Gp. sending g ^ w{g,ai) is not 
surjective, hence not injective. Let gi,gi be distinct elements of Gp- such that w{g, Ui) = w{gi,ai), 
and let B' be the image in G of the sequence {w{gi,ai))iyi-^^. Then the images in G of the 
sequences {gi)i>ii and {gi)i>ii are distinct solutions to w{X,A) = B'. □ 

7. Word equations not solvable by radicals 

In this section, we use Corollary 11.71 to find several infinite families of words that are not 
universal, and consequently not solvable in terms of radicals. 

Lemma 7.1. Let m and n be distinct positive integers, and let w = X"^AX"'. Then P^(a;^,y^) 
is irreducible in C[x,y]. 
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Proof. We view the word polynomial 

Pw{x,y) = ^ — — + yx —2 — - 

as a polynomial in y with coefficients in C(x). 

If m < n, then there exists C G C such that C'^™' 7^ 1 and C'^" = 1, in which case is a simple 
pole of f{x) = x~^"^(x^"^ — — 1); likewise, if m > n, then / has a simple root. Thus / 

is not a square in C(x), which implies that Pw{x'^,y'^) is irreducible in C{x)[y]. □ 

By Corollary 11.71 it follows that for positive integers m ^ n, the word equation 

= B 

has no solution in terms of radicals. Our next result shows that for m > and n > 1, the word 
equation 

has no solution in terms of radicals. 

Lemma 7.2. Let m > and n > 1 be integers, and let w = X^^+^nj^^m+n^^mj^^ y/^g^ 
Pw{x^-,y'^) has a factor inZ[x,y] which is irreducible over C[x,y]. 

Proof. The corresponding word polynomial factors as: 

PUx,y) = (1 + + - + x^y^^+Snj, 

The second factor, upon substituting x^ for x and y^ for y, is of the form h{x, y) = 1 + x'^f(y) + 
x^g{y). We claim that h is irreducible over C[x,y]. Let = C(y) be the algebraic closure of 
the field of rational functions in y. The polynomial h factors over K[x] as a product of the four 
linear factors 



X ± 



-f{y) ± 



2 

where D{y) = f{y)'^ — Ag{y). If h had another factorization over C[x,y] = C[t/][x], then it would 
have to further factor into this one (up to units) by unique factorization in In particular, 

this would imply that a proper subset of the above factors combine to give an element of C[3;, y]. 
It is easy to check that this can only happen if D{y) is a perfect square in C[y]. But in fact, 

is not a square (all of its nonzero roots are simple). □ 
The next example shows that the word equation 

XAX'^AX = B 
has no solution in terms of radicals if n > 3. 

Lemma 7.3. Let n > 3 be an integer, and let w = XAX^AX. Then Pw{x'^,y'^) is irreducible 
in C[x, y\. 

Proof. We have 

P.^{x\ y^) = l + {x^+x' + ...+ x2-)y2 + ^2n+2y4_ 

As in the proof of Lemma l7.2( this polynomial factors over C[x, y] only if 

D{x) = (1 + x2 + . . . + x2"-2)2 _ 4a;2«-2 

is a square in C[a;]. Any factor f of D that is irreducible over has distinct roots in C. 
Moreover, the zero sets in C of any two factors of D that are irreducible over Z[x] are either equal 
or disjoint. It follows that if D is a square in C[x], then each irreducible factor over Z[x] divides D 
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with even multiplicity, hence D is a square in TL\x\. But this is impossible, as -D(l) = — 4 is 
not a square in TL for n > 3. □ 

To further extend these families of words not solvable by radicals, note that under the hy- 
potheses of Theorem I l.H we can actually derive a slightly stronger conclusion. 

Corollary 7.4. Lei u,w be finite words in the alphabet {X,A}. If Pw{x'^ ,y'^) = has a solution 
{xp,yp) S (Z/pZ)* X (Z/pZ)* for all but finitely many primes p, then the word equations 

uow{X,A)=B and wou{X,A) = B 

have no solution in terms of radicals. 

Proof. By Lemma [ST] the word polynomial divides Puow, so the equation Puow{x^ ,y^) = 
has a nonzero solution modulo all sufficiently large primes. By Theorem 1 1.!) there exist A,B € G 
for which the equation u o w{X, A) = B has no solution X £ G. 

The statement for wou is immediate from Theorem ll.il since there exist A,BeG for which 
the equation w{X, A) = B has no solution X £ G, the equation wou{X, A) = w{u{X, A), A) = B 
also has no solution X £ G. □ 

A simple substitution also proves the following. 

Corollary 7.5. Let n > 2 be an integer. The word equation 

X'^iAXyX = B 

is not solvable by radicals. 

Proof. The solutions X to X'^{AX)'^X = B are in direct correspondence with solutions Y 
to YGY"+^CY = D via the relationships Y = A^/^XA^/^, G = A-\ D = A^/'^BA^/'^. In 
particular, if there are no solutions to the latter equation in a uniquely divisible group G for 
some G,D £ G, then there will be none to the former with A = G~^ and B = A~^I'^DA^'^I'^. □ 

As a final remark, one might ask about more general classes of word equations in uniquely 
divisible groups, such as equations of the form 

X'^MiX^Ma • • • = B (8) 

for positive integers k and no, . . . However, these equations almost never have solutions in 
terms of radicals. 

Corollary 7.6. The word equation (0j has a solution X in terms of radicals if and only if k = 1 
and uq = ni. 

Proof. Suppose first that k > 1. By setting A^ = . . . = A^ = 1, any solution in terms of radicals 
to dHj) yields a solution to 

X'^MiX^MaX'"^ = B 

in terms of radicals, where m2 = n2 + ■ ■ ■ + n^. If ng + ni 7^ m2, then set Ai = 1; otherwise, 
set A2 = 1. In either case we obtain a solution in terms of radicals to an equation of the form 
X^AX"^ = B with m ^ n, which is impossible by Lemma l7.ll 

In the case k = 1, if ng ^ ni then there is no solution by radicals by Lemma l7.ll while if 
1^0 = ni = n, then the word w = X^AX^ is totally decomposable, so the equation X^AX^ = B 
has a solution by radicals by Lemma 11.31 □ 
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